Towards Reconfigurable Rack-Scale Networking
نویسندگان
چکیده
Datacenters have traditionally been designed and deployed as a collection of server racks, where each rack consists of approximately 42 units of servers connected to one or two edge switches. In this model, computation is generally localized within a single server, and the edge switches provide 10 or 40 Gbps of non-blocking bandwidth between servers in the same rack. Although the traditional model is easy to deploy, application writers are increasingly finding that a unit of computation does not conveniently fit within a single server. To address this issue, a new datacenter model, known as rackscale computing, has been proposed. In rack-scale computing, a rack is not a collection of servers, but is instead a single system consisting of multiple interconnected subsystems, such as CPU, memory, and storage, that are disaggregated into separate locations within a rack. Depending on the design, a subsystem can be made up of components of the same type, where a component is essentially a miniserver with an abundance of a particular resource, or it can be made up of just the raw resources with the network serving as a rack-scale system bus. In this paper, we will focus on subsystems that are made out of components. Through disaggregation, components can be upgraded independently, and datacenter operators can customize the balance between different subsystems without having to replace or upgrade every component. The key barrier to realizing subsystem disaggregation is the absence of commodity, high-speed rack-scale networks. Current 40 Gbps networks may not have sufficient bandwidth to satisfy the data requirements between different subsystems. Simply increasing the network’s bandwidth is not a sustainable solution because it places an extreme demand on the edge switches. For example, a 100 Gbps network connecting 100 different components in a rack-scale computer, where a component is a unit of CPU, memory, or storage, requires an edge switch with a switching throughput of 10 Tbps. Distributed switching or server-centric networks can address this problem, but they introduce additional switching latency that can significantly reduce the performance of latency-sensitive applications. In order to achieve the goals of rack-scale computing, we believe there is a critical need for reconfigurable rack-scale networks that can dynamically change their topology to provide high-bandwidth, direct connectivity between different components in response to changing workloads. Although there has been a significant body of work on reconfigurable networks [1, 2, 3], they have all approached the problem by reconfiguring the connectivity between switches to reduce or eliminate the impact of network oversubscription. We believe that, for rack-scale computers, a more effective approach is to have more edge switches, where each switch is responsible for only a small group of components, and to dynamically reconfigure the group membership of the switches, as illustrated in Figure 1. This approach provides Packet Switch Network
منابع مشابه
XFabric: A Reconfigurable In-Rack Network for Rack-Scale Computers
Rack-scale computers are dense clusters with hundreds of micro-servers per rack. Designed for data center workloads, they can have significant power, cost and performance benefits over current racks. The rack network can be distributed, with small packet switches embedded on each processor as part of a system-on-chip (SoC) design. Ingress/egress traffic is forwarded by SoCs that have direct upl...
متن کاملR2C2: A Network Stack for Rack-scale Computers – Public Review
For modern Internet data centers, a key challenge to meeting the immense compute, storage, and networking needs of next-generation applications is the ability of the underlying infrastructure to scale. An important new trend in this direction is the introduction of socalled “rack scale” computers, which are large numbers of tightly integrated systems-on-chip (SoC) processors, interconnected wit...
متن کاملSimple and Cheap Theia: ˇ Networking for Ultra-Dense Data Centers
Recent trends to pack data centers with more CPUs per rack have led to a scenario in which each individual rack may contain hundreds, or even thousands, of compute nodes using system-on-chip (SoC) architectures. At this increased scale, traditional rack-level star topologies with a top-ofrack (ToR) switch as the hub and servers as the leaves are no longer feasible in terms of monetary cost, phy...
متن کاملQuery processing for datacenter-scale computers
Quickly exploring massive datasets for insights requires an efficient data processing platform. Parallel database management systems were originally designed to scale only to a handful of nodes, where each node keeps recent (“hot”) data in memory and has directlyattached hard disk storage for infrequently accessed (“cold”) data. To keep pace with the growing data volumes, the research focus has...
متن کامل